Clustering gene expression profile data by selective shrinkage
نویسندگان
چکیده
Clustering of gene expression profiles is a widely used approach for finding macroscopic data structure. A complication in such analyses is that not all genes are informative for forming clusters and different clusters might have different transcription regulation. Driven by these considerations, we present a novel two-stage clustering approach. The first stage identifies informative genes by adaptive variable selection using pseudo-samples modeled by a high dimensional multigroup ANOVA model. Variables are selected using a rescaled spike and slab Bayesian hierarchical model having a special selective shrinkage property. The second stage uses output from the first stage for clustering. We demonstrate why selective shrinkage occurs, and by extension, why it is useful for the clustering paradigm. We analyze a human gene atlas expression dataset where the question of interest is to look for tissue-specific transcription regulation and investigate whether tissues can be grouped together due to similar genomic control. c © 2008 Elsevier B.V. All rights reserved.
منابع مشابه
Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملخوشهبندی دادههای بیانژنی توسط عدم تشابه جنگل تصادفی
Background: The clustering of gene expression data plays an important role in the diagnosis and treatment of cancer. These kinds of data are typically involve in a large number of variables (genes), in comparison with number of samples (patients). Many clustering methods have been built based on the dissimilarity among observations that are calculated by a distance function. As increa...
متن کاملEvaluation of β-actin as a Reference Gene for Comparative Expression Analysis of Equine Adipose- and Bone Marrow-Derived Mesenchymal Stem Cells by qRT-PCR
Background Bone marrow and adipose tissue are two main sources of mesenchymal stem cells (MSCs). Some of studies suggest that there are some differences in gene expression profile of MSCs-derived from various tissues. To investigate gene expression profile by qRT-PCR, an appropriate reference gene with stable expression level should be chosen for normalizing data. This study was designed to e...
متن کاملGene Expression Profile Analysis during Mouse Tooth Development
Introduction: Complex molecular pathways involve in development of different tissues such as teeth. Differential gene expression patterns during teeth development generates different tooth types. Teeth development results from interactions between oral epithelium and underlying ectomesenchyme cells with neural crest origin. Teeth development are regulated by different signaling networks. In thi...
متن کاملEFFECT OF AEROBIC TRAINING AND ETHANOL CONSUMPTION ON LIPID PROFILE AND GENE EXPRESSION OF SOME GASTROCNEMIUS MUSCLE MYOKINES IN MALE RATS
Background: Skeletal muscle as an endocrine tissue is involved in the regulation of metabolic activity, production and secretion of hormones including myokines. The aim of the present study was to investigate the effect of eight weeks of aerobic training combined with ethanol consumption on plasma lipid profile and glucose levels, triglyceride content and mayonectin, irisin and leptin gene expr...
متن کامل